Learning Semantic Graph Mapping for Document Summarization
نویسندگان
چکیده
We present a method for summarizing document by creating a semantic graph of the original document and identifying the substructure of such a graph that can be used to extract sentences for a document summary. We start with deep syntactic analysis of the text and, for each sentence, extract logical form triples, subject–predicate–object. We then apply cross-sentence pronoun resolution, co-reference resolution, and semantic normalization to refine the set of triples and merge them into a semantic graph. This procedure is applied to both documents and corresponding summary extracts. We train Support Vector Machine on the logical form triples to learn automatic creation of document summaries. Our experiments with the DUC 2002 data show that increasing the set of attributes to include semantic properties and topological graph properties of logical triples yields statistically significant improvement of the F1 measure for the extracted summaries.
منابع مشابه
Impact of Linguistic Analysis on the Semantic Graph Coverage and Learning of Document Extracts
Automatic document summarization is a problem of creating a document surrogate that adequately represents the full document content. We aim at a summarization system that can replicate the quality of summaries created by humans. In this paper we investigate the machine learning method for extracting full sentences from documents based on the document semantic graph structure. In particular, we ...
متن کاملLearning Semantic Sub-graphs for Document Summarization
In this paper we present a method for summarizing document by creating a semantic graph of the original document and identifying the substructure of such a graph that can be used to extract sentences for a document summary. We start with deep syntactic analysis of the text and, for each sentence, extract logical form triples, subject–predicate–object. We then apply cross-sentence pronoun resolu...
متن کاملOntology-Based Automatic Text Summarization Using FarsNet
To summarize a text means to compress the text source into a shorter text in a way that the informational content is kept the same. With regard to the irregular volume of information available on the internet, manual summarization of huge volume of information by humans will be very arduous and difficult. There have been many activities in the field of automatic summarization so far. However, a...
متن کاملConcept-Graph Based Biomedical Automatic Summarization Using Ontologies
One of the main problems in research on automatic summarization is the inaccurate semantic interpretation of the source. Using specific domain knowledge can considerably alleviate the problem. In this paper, we introduce an ontology-based extractive method for summarization. It is based on mapping the text to concepts and representing the document and its sentences as graphs. We have applied ou...
متن کاملThe Effect of Semantic Mapping as a Vocabulary Instruction Technique on EFL Learners with Different Perceptual Learning Styles
Traditional and modern vocabulary instruction techniques have been introduced in the past few decades to improve the learners’ performance in reading comprehension. Semantic mapping, which entails drawing learners’ attention to the interrelationships among lexical items through graphic organizers, is claimed to enhance vocabulary learning significantly. However, whether this technique suits all...
متن کامل